Ab initio gene prediction for protein coding regions

نویسندگان

چکیده

Abstract Motivation Ab initio gene prediction in non-model organisms is a difficult task. While many ab methods have been developed, their average accuracy over long segments of genome, and especially when assessed wide range species, generally yields results with sensitivity specificity levels the low 60% range. A common weakness most tendency to learn patterns that are species-specific varying degrees. The need exists for extract genetic features can distinguish coding non-coding regions not sensitive specific organism characteristics. Results new method based on neural network (NN) uses collection sensors create input presented. It shown accurate predictions achieved even trained significantly different phylogenetically than test organisms. consensus algorithm CoDing Sequence (CDS) subsequently applied first nucleotide level NN boosts through data driven procedure optimizes CDS/nonCDS threshold. An aggregate benchmark at shows this approach performs better existing methods, while requiring less training data. Availability https://github.com/BioMolecularPhysicsGroup-UNCC/MachineLearning Supplementary information available Bioinformatics online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ab Initio Protein Structure Prediction

Predicting protein 3D structures from the amino acid sequence still remains as an unsolved problem after five decades of efforts. If the target protein has a homologue already solved, the task is relatively easy and high-resolution models can be built by copying the framework of the solved structure. However, such a modelling procedure does not help answer the question of how and why a protein ...

متن کامل

Ab initio protein structure prediction using chunk-TASSER.

We have developed an ab initio protein structure prediction method called chunk-TASSER that uses ab initio folded supersecondary structure chunks of a given target as well as threading templates for obtaining contact potentials and distance restraints. The predicted chunks, selected on the basis of a new fragment comparison method, are folded by a fragment insertion method. Full-length models a...

متن کامل

Ab initio protein structure prediction: progress and prospects.

Considerable recent progress has been made in the field of ab initio protein structure prediction, as witnessed by the third Critical Assessment of Structure Prediction (CASP3). In spite of this progress, much work remains, for the field has yet to produce consistently reliable ab initio structure prediction protocols. In this work, we review the features of current ab initio protocols in an at...

متن کامل

Ab initio protein structure prediction using a combined hierarchical approach.

As part of the third Critical Assessment of Structure Prediction meeting (CASP3), we predict the three-dimensional structures for 13 proteins using a hierarchical approach. First, all possible compact conformations of a protein sequence are enumerated using a highly simplified tetrahedral lattice model. We select a large subset of these conformations using a lattice-based scoring function and b...

متن کامل

Development of an ab initio protein structure prediction system ABLE.

An ab initio protein structure prediction system called ABLE is described. It is based on the fragment assembly method, which consists of two steps: dividing a target sequence into overlapping subsequences (fragments) of short length and assigning a local structure to each fragment; and generating models by assembling the local structures and selecting the models with low potential energy. One ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Bioinformatics advances

سال: 2023

ISSN: ['2635-0041']

DOI: https://doi.org/10.1093/bioadv/vbad105